Analyzing and revising data integration schemas to improve their matchability

نویسندگان

  • Xiaoyong Chai
  • Mayssam Sayyadian
  • AnHai Doan
  • Arnon Rosenthal
  • Leonard J. Seligman
چکیده

Data integration systems often provide a uniform query interface, called a mediated schema, to a multitude of data sources. To answer user queries, such systems employ a set of semantic matches between the mediated schema and the data-source schemas. Finding such matches is well known to be difficult. Hence much work has focused on developing semi-automatic techniques to efficiently find the matches. In this paper we consider the complementary problem of improving the mediated schema, to make finding such matches easier. Specifically, a mediated schema S will typically be matched with many source schemas. Thus, can the developer of S analyze and revise S in a way that preserves S’s semantics, and yet makes it easier to match with in the future? In this paper we provide an affirmative answer to the above question, and outline a promising solution direction, called mSeer. Given a mediated schema S and a matching toolM , mSeer first computes a matchability score that quantifies how well S can be matched against using M . Next, mSeer uses this score to generate a matchability report that identifies the problems in matching S. Finally, mSeer addresses these problems by automatically suggesting changes to S (e.g., renaming an attribute, reformatting data values, etc.) that it believes will preserve the semantics of S and yet make it more amenable to matching. We present extensive experiments over several real-world domains that demonstrate the promise of the proposed approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analyzing and Revising Mediated Schemas to Improve Their Matchability

Data integration systems often provide a uniform interface, called a mediated schema, to a multitude of disparate data sources. To answer user queries posed over the mediated schema, such systems employ a set of semantic matches between this schema and the local schemas of the data sources. Finding such matches is well known to be difficult. Hence much work has focused on developing semi-automa...

متن کامل

Predicting Inefficient Problem-Solving Methods based on Early Maladaptive Schemas in Drug-Dependent Individuals

Objective: The aim of this study was to predict the inefficient problem-solving methods of drug-dependent individuals based on early maladaptive schemas. Method: This study was a descriptive-correlational research. The statistical population of the study included all drug-dependent individuals referred to addiction treatment camps in Qom in 2019. The statistical sample of the study was 270 drug...

متن کامل

The effectiveness of MCT, on maladjusted Schemas among divorced women

Abstract  Divorce and its effects on the family system in recent years increasingly been the focus of psychological research, in line to the researches, this research was done to evaluate effectiveness of metacognitive therapy on maladaptive schemas among divorced women. The method of research was quasi- experimental design, with pretest-posttest and control group. For sampling 30 women who ...

متن کامل

Learning Styles and the Writing Process in a Digitally Blended Environment: Revising, Switching, and Pausing Behaviors in Focus

The present investigation sought to explore the relationship between learning styles and writing behaviors of EFL learners in a blended environment. It also aimed to identify the learning style types best predicting writing behaviors. Initially, the participants' preferred learning styles were identified through the Kolb’s learning style inventory (Kolb, 1984). Secondly, data were obtained thro...

متن کامل

The Effectiveness of Emotionally Focoused Couple therapy on their Emotional Schemas of Young Couples

Introduction: The aim of this study was to the effectiveness of emotion- focoused couple therapy on the emotional schemas of young couples. Methods: The research method was semi-experimental to pretest-posttest. The statistical population consisted of all young couples with adjustment problems who referred to psychological service and counseling centers in Tehran in the second half of 2019 (N=...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • PVLDB

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2008